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1  Complexity  of  GPSG-Recognition 

Gnzdar(i981)  proposes  restricting  the  class  of  generative  grammars  for  natural 
languages  to  a  subset  of  those  with  weak  context-free  generative  power.  Since 
context-free  languages  ran  be  parsed  in  0(n3)  time,  Gor.dar  argues  this  would 
provide  “the  beginnings  of  an  explanation  for  the  obvious,  but  largely  ignored, 
fart  that  humans  process  the  utterances  they  hear  very  rapidly.” 1  In  this  paper, 
I  prove  Gaidar’s  appeal  to  general  niathematiral  results  on  context-free  parsing 
f.iils:  Generalised  Plirase  Structure  Grammar  (GPSG)  does  not  have  the  desired 
“efficient  parsibility”  result  unless  other  constraints  arc  added  to  the  theory. 
As  things  currently  stand,  GPSG,  Lexical  Functional  Grammar  (LFG),  and 
Transformational  Grammar  (TG)  all  have  a  general  “intractable  parsibility” 
result. 

A  Generalised  Phrase  Structure  Grammar  (GPSG)  consists,  at  core,  of  a 
finite  set  of  context-free  tiasc  rules  plus  a  fini  _■  set  of  metarules.  Metarules  are 
functions  from  base  rules  to  sets  of  base  rules.2  For  example,  the  metarule 

(  VP  -  V  VP)  =>  1  VP  -*  V  ADVP  VP) 

1+  »"] 

1+  •»*] 

stipulates  that  for  every  context-free  rule  expanding  a  finite  VP  as  a  finite 
euxiliary  and  a  nonfinitc  VP,  there  will  be  an  identical  rule  expanding  the  VP 
as  before,  except  with  an  adverb  between  the  auxiliary  and  nonfinite  VP. 

Unconstrained  metarule  application  may  generate  infinite  sets  of  rules  and 
describe  arbitrary  languages.3  To  preserve  both  the  context-free  weak  genera¬ 
tive  power  of  a  GPSG  grammar  and  the  supposedly  attendant  computational 
benefits,  some  formal  constraints  on  metarules  have  been  proposed  in  the  GPSG 
literature.  One  proposal  is  to  constrain  variables  in  the  metarule  pattern  to  be 
“abbreviatory  variables,”  i.c.  variables  that  can  only  stand  for  strings  in  a  finite 
and  extrinsically  determined  range.4  While  this  constraint  can  affect  the  ex¬ 
tensions!  language  of  the  grammar  in  linguistically  unmotivated  and  arbitrary 
ways,  1  adopt  the  constraint  in  my  proof  for  the  purposes  of  examining  its  com¬ 
putational  implications.  Following  most  of  the  work  in  GPSG,  I  allow  metarules 
to  apply  recursively  to  their  own  outputs,  subject  to  the  restriction  that  the  set 
of  rules  derived  from  metarule  application  is  finite.  This  constraint  preserves 
the  context-free  generative  power  of  the  formalism. 

Using  only  a  minimal  GPSG  grammar,  1  prove  that  the  recognition  problem 
for  GPSG  —  is  a  string  an  element  of  the  language  generated  by  the  gram¬ 
mar?  —  is  NP-hard.  A  problem  T[x)  is  NP-bard  if  it  is  at  least  as  hard 

’Gaidar(1081)  p.lM. 

’See  Sag  et.al.(lOM) 

3 Fur  a  discussion  of  this  and  tome  proposal*  for  restricting  metarules,  see  Shieber  et.al.(lOU) 
4See  Shieber  et.al.(lM3)  for  a  discussion  of  Gaidar's  1013  proposal. 
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computationally  w  imy  problem  in  NP:  If  we  bad  a  subroutine  that  solved  7  in 
polynomial  time,  then  we  could  write  a  program  to  solve  any  problem  in  NP  in 
polynomial  time  on  a  deterministic  Turing  machine.  Since  it  is  widely  believed 
(though  not  proved)  that  the  hardest  problems  in  NP  (NP-complctc  problems) 
cannot  be  solved  in  polynomial  time,  NP-hardness  is  considered  equivalent  to 
computational  intractability. 

To  prove  that  GPSG-Rccognition  is  NP-hard,  I  reduce  3-Satisfiability  (3- 
SAT),  an  NP-complctc  problem,  to  GPSG-Rccognition.  A  reduction  works  by 
converting  instances  of  3-SAT  into  instances  of  GPSG-Rccogntion  in  polynomial 
time.  V  we  had  a  polynomial  time  subroutine  for  solving  GPSG-Rccognitkm, 
then  we  eonld  also  solve  3-SAT  in  polynomial  time,  simply  by  converting  into  the 
GPSG-Rccognition  problem.  Since  we  know  3-Satisfiability  is  the  hardest  prob¬ 
lem  in  NP  (NP-camplete),  then  the  redaction  shows  that  GPSG-Recognitkm  is 
at  least  as  hard  as  all  problems  in  NP  (NP-hard). 

The  3- Satisfiability  problem  is:  Given  a  boolean  formula  in  conjunctive  nor¬ 
mal  form  (CNF)  where  each  clause  contains  exactly  three  variables,  is  there 
some  assignment  of  truths  to  the  formula’s  variables  that  will  make  the  formula 
true?  An  example  of  a  satisfiable  3-CNF  boolean  formula  with  five  clauses 
(where  +  indicates  disjunction)  is: 

(a +  6  +  c)(S  +  d  +  g)(t  +  J  +  $)(6  +  c  +  d)(3  +  2  +  1) 

1.1  GPSG-Recognition  is  NP-hard 

Theorem  1:  GPSG-Rccognition  is  NP-hard.  ( 

Proof:  On  input  3-CNF  formula  F  of  length  m  using  n  variables,  reduce  3-SAT 
to  GPSG-Recognition  in  polynomial  time. 

Let 

w  be  the  string  resulting  from  removing  all  parenthesis  and 

disjunct  symbols  from  F 

q  be  the  n  variables  in  some  canonical  order,  where  9,  de¬ 

notes  the  iik  variable 

Si  denote  the  i(*  symbol  in  the  string  x 

S  be  the  distinguished  start  symbol 

X ,  Y,  Z,p  be  special  symbols  denoting  members  of  {0, 1}* 

A,B,0,1,#,|  be  special  symbols 

We  construct  a  GPSG  grammar  G  s.t.  the  special  symbol  #  is  an  element 
of  L(G)  iff  F  is  satisfiable.  G  contains 

1.  n  +  1  base  rules: 
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the  i,h  rule:  [yt  — » ti>  j  1*0"  ']  where  0  <  t  <  n 


2.  the  ten  metarules 

\A  -*  to  |  XaYbZ] 

=*• 

[A  -*  w  |  z] 

==> 

\B  ~>w'  \  x] 

[5  #00 lp] 

\S  #010p] 

[S  -  #100p] 

=» 

[S  -  #011p] 

==> 

[S  -  #101p) 

=> 

(S  -  #110p] 

=> 

[S  -  #11  lp] 

= > 

\A  -  w  |  XbYaZ ] 
where  a,b  €  {0, 1} 

[B  -W  |  x ] 

where  if  u»j  =  q]  then  tw'  =  | 

[S  -  #w"] 

where  if  u»j  =  qj  then  w "  =  zy 

[S  -  #p] 

Is  “*  #P] 

(S-#p] 

\s  -*  #p] 

(5-#p) 

(5-#p1 

[S  -  #p] 


if  zy  =  0 
if  z j  =  1 


The  first  metarule,  in  conjunction  with  the  n  +  1  base  rules,  generates  all 
possible  truth  assignments  for  the  n  variables.  The  second  metarule  instantiates 
negated  variables,  and  the  third  metarule  instantiates  unnegated  variables.  The 
last  seven  metarules  determine  whether  the  assignment  that  was  guessed  satisfies 
the  formula  F.  All  variables  in  the  ten  metarules  are  either  constants  (e.g.  w) 
or  “abbreviatory  variables”  (e.g.  p)  which  stand  for  a  string  in  {0,1}*  of  length 
less  than  m.  If  these  variables  were  removed,  O(m)  new  rules  would  replace 
each  existing  rule  which  makes  use  of  an  “abbreviatory  variable.” 

The  rule  set  resulting  from  metarule  application  is  guaranteed  finite  because 
no  metarule  derives  rules  longer  than  its  input  rules.  The  reduction  can  be 
performed  in  polynomial  time  (exactly  0(mJ)  time)  because  there  are  n  base 
rules  of  length  0(n),  n  <  m,  and  only  a  constant  number  of  metarules.  Q.E.D. 

Example  reduction: 

Lfii 

F  =  (o  +  6  +  c)(o  +  6  +  c)(o  +  6  +  c)(o  +  5  +  c)(a  +  6  +  c)(S  +  6  +  Z)(I+5  +  c) 
w  =  abcabcabcabcabcabcSbc 
q  =  abe 


The  grammar  we  construct  for  F  contains 
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1.  n  +  1  lmae  rules 


[it  -  w  |  000]  [A-w\  100]  \A  -*  w  |  110]  [A  -  w  |  111] 

2.  metarule  1  generates  the  remaining  possible  truth  assignments,  deriving 
4  additional  rules 

[it-*  tv  |  010]  [it-*  tv  |  001]  [it-*  tv  1 101]  [it-*  tv  |  Oil] 

3.  metarule  2  instantiates  negated  variables,  deriving  8  additional  rules 

[B  — *  akufelaicalllfcclfclllc  |  000] 

[B  — >  afccaHaleall06c06101c  |  100] 

[B  -*  afccatlu0ca010hc06100c  |  110] 

[B  — *■  abcabOaOcaOOObcObOOOc  |  111] 

[B  — *  atcafclaOcaOlltclfcllOc  |  010] 

IB  -*  abcabOa lea  10 16c  1601  lc  ]  001] 

[B  — *  abnabOa  lea  lOOfceOiOOlc  |  101] 

[B  -*  atcafcOaOcaOOlftcltOlOc  |  Oil] 

4.  metarule  3  instantiates  unnegated  variables,  deriving  8  additional  rules 

[S  -*  #000001010011100101110] 

[S  -*  #100101110111000001010] 

[5  -»  #110111100101010011000] 

[5  -  #111110101100011010001] 

[5  -*  #010011000001110111100] 

\S  -  #001000011010101100111] 

[S  -  #101100111110001000011] 

[5  -*  #011010001000111110101] 

5.  metarules  4  through  10  cheek  if  tlie  assignment  guessed  satisfies  the  for¬ 
mula  F,  generating  28  additional  rules 
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a  -  c  »  C  V 1  HI  E  V  rm  gl  L-»  l-V  U'V  L-W  LSI  '..-VISI  LS.  I'M  IV  l  M  IV  WWW  CTJ  USI  I VT  BT  '..<V  L  V  1  %'VV  T 


\S  -*  #101110111000001010]  [S  -*  #111100101010011000] 

(5  -  #110101100011010001]  |5  -  #011000001110111100] 

[.S'  -  #000011010101100111]  [S  -  #100111110001000011] 

[S  -  #010001000111110101] 

\S  -*  #110111000001010] 

[5  -  #101100011010001] 

[S  -  #111110001000011] 

[5  -*  #111000001010] 

[S  -»  #100011010001] 

(S  -  #000111110101] 

[S  -»  #000001010] 

[S  -* #011010001] 

[5  -  #011000] 

[5  -  #000011] 

(5  -  #000] 

[s  -  #] 

Note  #  6  L(G)  because  the  rule  [S  — *  #]  was  derived,  so  the  formula  is  satisfi- 
able  (with  the  assignment  a6c  =  111,  as  the  reader  can  verily). 

1.2  Generative  Power  and  Computational  Complexity 

At  first  glance,  a  proof  that  GPSG-Recognition  is  NP-hard  appears  to  contradict 
the  context-free  generative  power  result  noted  above.  After  all,  there  exist  a 
wide  range  of  algorithms  capable  of  recognising  context-free  languages  in  time 
0(n»). 

The  connection  between  weak  generative  power  and  efficient  parsibOity  is 
not  as  direct  as  was  evidently  assumed.  The  crux  of  the  matter  is  that  a  GPSG 
grammar  M  can  result  in  a  context-free  rule  set  M'  of  any  sise  whatsoever, 
subject  only  to  the  restriction  that  it  be  finite.  M'  can  be  of  sise  0(2*)  whet* 
|Af|=  Ik,  or  even  significantly  larger.  Context-free  parsers  like  the  Earley  algo¬ 
rithm  actually  run  in  time  0(|G|*  n3)  where  |G|  is  the  context-free  grammar 
sise  and  n  the  input  length,  so  the  hypothetical  GPSG  grammar  M  will  be  rec¬ 
ognised  in  time  0(4*n3).  The  exponential  term  will  clearly  dominate  the  Earley 
algorithm  complexity  in  the  reduction  above  because  |Af  j  is  a  function  of  the 
length  m  of  the  input  formula  F.  Even  if  the  GPSG  grammar  is  held  constant, 
the  exponential  increase  in  derived  grammar  sise  will  result  in  an  astronomical 


[S  -  #100101010011000] 
[.S'  -  #000001110111100] 
[5  -»  #001000111110101] 

[5  -  #101010011000] 

[5  -  #110001000011] 

[S  -*  #010011000] 

[S  -*  #001000011] 

\S  -  #010001] 

\s  -  #001] 
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const  fin  t  factor,  which  in  turn  will  dominate  the  real-world  performance  of  the 
Earley  algorithm  for  all  expected  inputs  (i.e.  those  of  a  million  words  or  less). 


1.3  Restricting  Metarule  Application 

Since  the  central  problem  is  that  a  metarule  is  capable  of  deriving  any  finitely 
large  set  of  rules,  including  exponentially  large  ones,  we  must  further  constrain 
metarule  application  if  we  wish  to  solve  the  GPSG-Rccognition  problem  in  poly¬ 
nomial  time  (i.e.  obtain  an  efficient  parsibility  result). 

Let  us  imagine  severely  restricting  the  derivational  power  of  metarules  as 
follows:* 

1.  a  metarule  may  only  operate  once  in  the  derivation  of  a  given  rule  (i.e. 
not  recursively  on  its  own  output). 

2.  a  metarule  is  a  function  from  one  context-free  rule  to  another  context-free 
rule  (and  not  from  a  rule  to  a  set  of  rules). 

3.  all  metarule  variables  either  are  constants  (e.g.  w)  or  stand  for  a  single 
symbol  and  can  only  have  two  possible  values  (e.g.  x  can  only  stand 
for  the  symbols  0,1).  Note  that  this  is  more  restrictive  than  Gaidar’s 
abbreviatory  variables  constraint. 

4.  no  unrecoverable  deletion  occurs  in  the  derivation  of  a  given  context-free 
rule. 

5.  no  two  metarules  or  basic  rules  are  identical  either  in  pattern  or  function. 

Even  in  such  a  severely  restricted  system,  GPSG-Rccognition  will  be  NP- 
hard.  An  0(m *)  time  reduction  is  included  in  appendix  A.  Since  each  metarule 
application  can  double  the  siie  of  the  grammar  G,  and  there  arc  now  0(m*) 
metarules,  the  resulting  grammar  will  still  be  siie  0(|  G  |  -2m  ),  or  still  ex¬ 
ponentially  larger  than  G,  and  the  Earley  algorithm  recognition  time  will  be 
0((|G|-2m  )*m»)  ==  0(4m,m9). 

A  list  of  restrictions  necessary  to  remove  GPSG-Rccognition  from  the  class 
NP-hard 


5  A*  far  as  I  know,  theie  three  reitrictione  together  go  beyond  anything  diacoaeed  in  the 
openly  available  GPSG  literature. 

'in  order  to  guarantee  that  these  four  reitrictione  are  sufficient,  GPSG  muit  be  completely 
and  exactly  formally  specified,  in  a  manner  which  ensures  that  proliferation  of  categoric* 
will  not  make  the  recognition  problem  intractable.  Another  aspect  of  current  GPSG  for¬ 
mulations  which  make  them  NP-hard  —  and  probably  intractable  —  is  the  Immediate 
Dominance /Linear  Precedence  (ID/LP)  formalism.  Sec  Dartnn(1084)  for  a  proof. 

Note  that  the  linguistically  untenable  restriction  of  prohibiting  metarule  variables  of  any 
kind  i*  probably  sufficient,  when  coupled  with  ID/LP  restrictions,  to  guarantee  polynomial 
time  recognition.  Such  a  restriction  would  mean  that  a  metarule,  which  may  only  “match" 
one  baric  rule,  can  only  derive  exactly  one  rule.  The  rite  of  the  derived  context-free  rule 


1.  strictly  bounded  ‘‘chaining’1  only  a  constant  number  of  metarules,  fixed 
in  advance  for  all  GPSG  griunumrs,  can  o]>crate  in  the  derivation  of  a  given 
context-free  role. 

2.  each  metarule  may  derive  a  rule  set  only  polynomially  bigger  than  its 
input  rule  set. 

3.  a  metarule  may  only  use  “abbreviatory  variables.’’ 

4.  metarules  are  unable  to  apply  to  other  metarules. 

The  simplest  effective  restriction,  and  one  that  is  not  much  different  from  the 
list  of  restrictions  above,  is  to  remove  metarules  from  GPSG  altogether.  Since 
metarules  can  only  derive  polynomially  large  rule  sets  and  limited  chaining 
will  make  metarule  interaction  very  unpredictable,  the  linguist  would  be  better 
off  writing  out  the  context-free  role  set  in  advance,  and  ignoring  metarules 
altogether. 

On  the  other  hand,  eliminating  metarules  poses  its  own  problems,  typical 
GPSG  systems  result  in  very  large  sets  of  derived  rules  —  literally  trillions 
of  rules  —  and  writing  out  these  context-free  rule  sets  in  advance  would  not 
be  feasible.7  Furthermore,  a  context-free  grammar  lacking  metarules  foils  to 
capture  linguistically  significant  relations  between  rules.  This  significant  (and 
apparently  unrcaolvable)  metarule-inclusion  issue  plagues  us  only  if  we  maintain 
the  “context-free  weak  generative  power”  framework.  The  obvious  next  step  « 
to  abandon  that  framework. 

1.4  Conclusion 

The  moral  of  my  proof  is  that  as  far  as  we  know,  it  is  not  possible  to  appeal 
to  general  mathematical  results  to  rescue  “efficient  parsibility”  results.  Specific 
constraints  on  the  particular  representations  postulated  by  linguistic  theory  are 
needed  to  explain  efficient  parsibility.  This  docs  not  imply  that  GPSG  theory  is 
without  merit:  on  the  contrary,  I  have  merely  shown  that  its  particular  efficient 
parsibilty  thesis  cannot  be  maintained.  Generalised  Phrase  Structure  Grammar, 
Lexical  Functional  Grammar,  and  Transformational  Grammar  are  all  probably 
intractable  in  an  attract  mathematical  sense,  and  each  theory  must  search 
elsewhere  for  an  explanation  of  efficient  parsibility,  if  one  is  to  be  given  at  all.1 

set  would  be  the  site  of  the  basic  rule  set  plus  the  number  of  metarule*.  This  ratfktfcw 
is  linguistically  unmotivated  because  it  fails  to  capture  linguistically  important  general! ra¬ 
tions.  For  example,  any  metarule  applying  to  singular  and  plural  sentences  would  have  to 
be  replicated  at  least  twice:  once  to  handle  the  singular  case,  and  ooce  to  handle  the  plural 
ease. 

7 See  Shieber(lMS)  p.lS7 

'See  Berwick  and  Weinberg(lM4)  for  a  discussion  at  LPG  complexity. 


The  first  order  of  business  is  to  discover  an  explanatory  computational  theory 
of  natural  language,  which  identifies  the  goal  of  the  computation,  explains  why  it 
is  appropriate,  and  describes  the  logic  of  the  strategy  by  which  it  can  be  carried 
out.  Once  we  develop  the  computational  theory  and  more  clearly  understand  the 
butguage  faculty,  then  we  can  devise  algorithms  and  even  postulate  neurological 
mechanisms  wliich  implement  the  algorithms.  No  magical  mathematical  result 
is  likely  to  rescue  us  from  this  hard  work. 

Metagrammatical  devices  should,  I  believe,  be  avoided  in  principle  for  both 
linguistic  and  computational  reasons.  Such  devices  try  to  describe  some  reg¬ 
ularity  or  generality  not  expressible  in  the  core  grammar.  A  more  powerful 
approach  would  discover  a  representation  or  principle  that  explains  the  given 
phenomenon,  rather  than  merely  describing  it.  Instead  of  resorting  to  stipu- 
latory  language  descriptions,  linguists  should  search  for  grammatical  principles 
and  representations  that  combine  in  an  explanatory  theory. 
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A  Reduction  from  3-SAT  to  GPSG-Recogr  ;tion 

The  reduction  makes  use  of  metarule  schemata  solely  to  increase  clarity  d  ex¬ 
position. 

Theorem  2:  GPSG-Rccognition  is  NP-hard,  even  when  severely  constrained. 


Proof:  On  input  3-CNF  formula  F  of  length  m  using  n  variables,  reduce  3-SAT 
to  GPSG-Rccognition  in  polynomial  time. 


Let 


9 

x« 

S 

X,Y,Z,p 
A,B,0, 1,#,  | 


be  the  string  resulting  from  removing  all  parenthesis  and 
disjunct  symbols  from  F 

be  the  n  variables  in  some  canonical  order,  where  4,  de¬ 
notes  the  ilh  variable 
denote  the  string  in  {y}*  of  length  t 
denote  the  ith  symbol  in  the  string  * 
be  the  distinguished  start  symbol 
be  special  symbols  denoting  members  of  {0, 1}* 
be  special  symbols 


As  above,  wc  construct  a  GPSG  grammar  G  s.t.  the  special  symbol  #  is  an 
element  of  L(G)  iff  F  is  satisfiable.  G  contains 


1.  n  +  1  base  rules: 

the  ith  rule:  [A  -»  w  |  l'O"-*]  where  0  <  i  <  n 

2.  the  metarule* 

(a)  (n  +  l)2  metarules,  to  generate  all  possible  truth  assignments,  de¬ 
scribed  by  the  schema 

[A  -*  to  |  x'ax*bxk]  =>  [A  — ►  so  |  xibx*axk) 

where  a,b,x  €  {0, 1},  0  <  i,j  <  n,  and  k  =  n  —  1  —  jf  —  2. 

(b)  two  metarules,  to  instantiate  negated  and  unnegated  variables 

[A  -»  to  |  x\  =>  [B  -» to'  |  x ] 

where  if  w,  =  47  then  »,'  =  |  J  I  J 

[B  -  to'  |  *]  =»  [S  -*  #10") 

where  if  toj  =  q,  then  to('  =  Xj 
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(c)  7 n  metarules,  to  check  that  the  guessed  assigmcnt  satisfies  F,  de¬ 
scribed  by  the  7  schemata 


(S  #001z“] 

=> 

\S  -  #*“] 

[S  -  #010*“] 

=► 

[S  -  #***] 

[S  -4  #100*“] 

=> 

|S  -  #*sk] 

[S  -»  #011*“] 

=* 

[5  #*“] 

[S  -  #101*“] 

==> 

(5  #*“] 

\S  #110*“] 

=> 

[S  -  #**‘] 

(5  -  #111*“] 

==> 

[S  -  #*“] 

where  0  <  k  <  n  -  1 

The  result  of  applying  the  two  “instantiating”  metarules  or  any  metarule 
described  by  the  last  metarule  schemata  is  to  change  the  basic  rule’s  righthand 
side  so  that  the  metarule  will  not  apply  to  it  again. 

1  now  prove  that  all  possible  truth  assignments  can  be  generated  by  the  first 
metarule  schema  above,  subject  to  the  restriction  that  a  metarule  may  only 
operate  once  in  the  derivation  of  a  given  rule.  This  is  equivalent  to  proving  that 
we  can  generate  all  binary  numbers  from  0  to  2"  -  1  inclusive,  using  only  the 
n  +  1  binary  numbers 


where  0  < »'  <  » 

and  (n  +  1)J  metarules  described  by  the  schema 

\z'ax*bxk}  =>  [z'6zJ’a*k] 

where  a,b,z  €  {0,1},  0  <  s',  j  <  n,  and  k  =  n-  i-  j~  2.  These  metarules 
perform  the  action  of  exchanging  any  two  given  bit  positions  in  a  binary  number 
of  length  n. 

Let 

x  be  a  binary  number  with  k  l'a  in  its  binary  representation, 

0  <  *  <  2" 

y  =  l*Q"~k  be  the  k*  binary  number 

The  algorithm  found  in  figure  1,  expressed  in  a  generic  programming  lan¬ 
guage,  derives  x  from  y  using  the  metarules.  No  metarule  can  be  applied  twice 
because  i  and  j  are  different  everytime  a  metarule  is  applied  (in  line  6).  In  any 
given  derivation,  clearly  at  most  f  j]  metarules  are  applied  (see  line  1),  and 
exactly  mmfk,  n-k)  metarules  arc  applied  in  any  given  derivation.  Q.E.D. 


w 
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Figure  1:  Algorithm  to  derive  x  from  y  using  metarules. 


:-3 


procedure  =  derive  (x,y) 

1:  for  i  =  1  to  f 

2:  if  /  Vi  then  do 

3:  for  j  =  n  to  0  step  -1 

4:  if  Xj  /  yj  then  goto  6 

5:  next  j 

6:  apply  metarule  [xt~iaxt~i~lbxn~>]  to  y, 

switching  bit  positions  t  and  j  in  the  number  y 

7:  next  i 


Example  derivation: 

Ls! 

x  =  0010011110  be  a  binary  number  with  5  l's  in  its  Unary  representation, 
y  =  1111100000  be  the  5th  binary  number 


y 

1111100000 

0111100010 

0011100110 

0010101110 


meUryfr  MKd 

[axxxzxxxbx]  =>  [bxxxxxxxax] 
[xu xxxxxbxx]  =>  [xbxxxxxaxx] 
[xxxaxxbxxx]  =>  [xxxbxxaxxx] 
[xxxxnbxxxx]  =>  [xzzxbazxxz] 


next  y 

0111100010 

0011100110 

0010101110 

0010011110 


